A pronunciation lexicon for turkish based on two-level morphology
نویسندگان
چکیده
This paper describes the implementation of a full-scale pronunciation lexicon for Turkish based on a two-level morphological analyzer. The system produces at its output, a parallel representation of the pronunciation and the morphological analysis of the word form so that morphological disambiguation can be used to disambiguate pronunciation when necessary. The pronunciation representation is based on the SAMPA standard and also encodes the position of the primary stress. The computation of the position of the primary stress depends on an interplay of any exceptional stress in root words and stress properties of certain morphemes, and requires that a full morphological analysis be done. The system has been implemented using XRCE Finite State Toolkit.
منابع مشابه
The architecture and the implementation of a finite state pronunciation lexicon for Turkish
This paper describes the architecture and the implementation of a full-scale pronunciation lexicon for Turkish using finite state technology. The system produces at its output, a parallel representation of the pronunciation and the morphological analysis of the word form so that morphological disambiguation can be used to disambiguate pronunciation. The pronunciation representation is based on ...
متن کاملThe architecture and the implementation of a finite state pronunciation lexicon for Turkish q
This paper describes the architecture and the implementation of a full-scale pronunciation lexicon for Turkish using finite state technology. The system produces at its output, a parallel representation of the pronunciation and the morphological analysis of the word form so that further disambiguation processes can be used to disambiguate pronunciation. The pronunciation representation is based...
متن کاملA Finite State Pronunciation Lexicon for Turkish
This paper describes the implementation of a full-scale pronunciation lexicon for Turkish using finite state technology. The system produces at its output, a parallel representation of the pronunciation and the morphological analysis of the word form so that morphological disambiguation can be used to disambiguate pronunciation. The pronunciation representation is based on the SAMPA standard an...
متن کاملGrapheme-to-phoneme Conv Morphologica
This paper presents a new approach for grapheme-to-phoneme conversion based on morphology. With this approach, a high accuracy can be obtained, although not for all words a transcription is achieved. The principle of this approach is to automatically decompose an existing pronunciation lexicon into morpheme-similar units called pseudo-morphological units. The pronunciation of the pseudo-morphol...
متن کاملComputer-assisted Learning of Turkish Morphology Draft 1.0; November 8; for Comments
We describe the design objectives, features, and the computational language model of a computer-mediated tool designed for learners of Turkish morphology. The underlying system is a generative grammar, more speciically, a computational word grammar that makes use of feature structures to deliver composition and decomposition of morphemes at the syntax-lexicon interface via two-level morphology....
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003